CLAIMS 

1 1. (previously presented) A method for synthesizing an auditory scene, comprising: 

2 processing at least one input channel to generate two or more processed input signals; 

3 filtering the at least one input channel to generate two or more diffuse signals; and 

4 combining the two or more diffuse signals with the two or more processed input signals to 

5 generate a plurality of output channels for the auditory scene, wherein processing the at least one input 

6 channel comprises: 

7 converting the at least one input channel from a time domain into a frequency domain to 

8 generate a plurality of frequency-domain (FD) input signals; 

9 delaying the FD input signals to generate a plurality of delayed FD signals; and 

10 scaling the delayed FD signals to generate a plurality of scaled, delayed FD signals. 

1 2. (canceled) 

1 3. (previously presented) The invention of claim 1, wherein: 

2 the FD input signals are delayed based on inter-channel time difference (ICTD) data; and 

3 the delayed FD signals are scaled based on inter-channel level difference (ICLD) and inter- 

4 channel correlation (ICC) data. 

1 4. (original) The invention of claim 3, wherein: 

2 the at least one input channel is at least one combined channel generated by performing binaural 

3 cue coding (BCC) on an original auditory scene; and 

4 the ICTD, ICLD, and ICC data are cue codes derived during the BCC coding of the original 

5 auditory scene. 

1 5. (original) The invention of claim 4, wherein the at least one combined channel and the 

2 cue codes are transmitted from an audio encoder that performs the BCC coding of the original auditory 

3 scene. 

1 6. (original) The invention of claim 3, wherein different ICTD, ICLD, and ICC data are 

2 applied to different frequency sub-bands of the corresponding FD signals. 

1 7. (previously presented) The invention of claim 1, wherein: 

2 the diffuse signals are FD signals; and 
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3 the combining comprises, for each output channel: 

4 summing one of the scaled, delayed FD signals and a corresponding one of the FD 

5 diffuse input signals to generate an FD output signal; and 

6 converting the FD output signal from the frequency domain into the time domain to 

7 generate the output channel. 

1 8. (original) The invention of claim 7, wherein filtering the at least one input channel 

2 comprises: 

3 applying two or more late reverberation filters to the at least one input channel to generate a 

4 plurality of diffuse channels; 

5 converting the diffuse channels from the time domain into the frequency domain to generate a 

6 plurality of FD diffuse signals; and 

7 scaling the FD diffuse signals to generate a plurality of scaled FD diffuse signals, wherein the 

8 scaled FD diffuse signals are combined with the scaled, delayed FD input signals to generate the FD 

9 output signals. 

1 9. (original) The invention of claim 8, wherein: 

2 the FD diffuse signals are scaled based on ICLD and ICC data; 

3 the at least one input channel is at least one combined channel generated by performing BCC 

4 coding on an original auditory scene; and 

5 the ICLD and ICC data are cue codes derived during the BCC coding of the original auditory 

6 scene. 

1 10. (original) The invention of claim 9, wherein the at least one combined channel and the 

2 cue codes are transmitted from an audio encoder that performs the BCC coding of the original auditory 

3 scene. 

1 11. (original) The invention of claim 9, wherein different ICLD and ICC data are applied to 

2 different frequency sub-bands of the corresponding FD signals. 

1 12. (original) The invention of claim 7, wherein filtering the at least one input channel 

2 comprises: 

3 applying two or more FD late reverberation filters to the FD input signals to generate a plurality 

4 of diffuse FD signals; and 
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5 scaling the diffuse FD signals to generate a plurality of scaled diffuse FD signals, wherein the 

6 scaled diffuse FD signals are combined with the scaled, delayed FD input signals to generate the FD 

7 output signals. 

1 13. (original) The invention of claim 12, wherein: 

2 the diffuse FD signals are scaled based on ICLD and ICC data; 

3 the at least one input channel is at least one combined channel generated by performing BCC 

4 coding on an original auditory scene; and 

5 the ICLD and ICC data are cue codes derived during the BCC coding of the original auditory 

6 scene. 

1 14. (original) The invention of claim 13, wherein different ICLD and ICC data are applied 

2 to different frequency sub-bands of the corresponding FD signals. 

1 15. (previously presented) The invention of claim 1 , wherein the method generates more 

2 than two output channels from the at least one input channel. 

1 16. (original) The invention of claim 15, wherein the method synthesizes a surround sound 

2 auditory scene. 

1 17. (original) The invention of claim 15, wherein a single input channel is used to 

2 synthesize the auditory scene. 

1 18. (original) The invention of claim 1, wherein: 

2 the method applies the processing, filtering, and combining for input channel frequencies less 

3 than a specified threshold frequency; and 

4 the method further applies alternative auditory scene synthesis processing for input channel 

5 frequencies greater than the specified threshold frequency. 

1 19. (original) The invention of claim 18, wherein the alternative auditory scene synthesis 

2 processing involves coherence-based BCC coding without the filtering that is applied to the input 

3 channel frequencies less than the specified threshold frequency. 

1 20. (canceled) 
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1 21. (currently amended) Apparatus for synthesizing an auditory scene, comprising: 

2 a configuration of at least one time domain to frequency domain (TD-FD) converter and a 

3 plurality of filters, the configuration adapted to generate two or more processed FD input signals and two 

4 or more diffuse FD signals from at least one TD input channel; 

5 two or more combiners adapted to combine the two or more diffuse FD signals with the two or 

6 more processed FD input signals to generate a plurality of synthesized FD signals; and 

7 two or more frequency domain to time domain (FD-TD) converters adapted to convert the 

8 synthesized FD signals into a plurality of TD output channels for the auditory scene, wherein: 

9 the configuration comprises: 

10 a first TD-FD converter adapted to convert the at least one TD input channel into 

11 a plurality of FD input signals; 

12 a plurality of delay nodes adapted to delay the FD input signals to generate a 

13 plurality of delayed FD signals; and 

14 a plurality of multipliers adapted to scale the delayed FD signals to generate a 

15 plurality of scaled, delayed FD signals; and 

16 the combiners are adapted to sum, for each output channel, one of the scaled, delayed FD 

1 7 signals and a corresponding one of the diffuse FD signals to generate one of the synthesized FD signal 

18 wherein the apparatus is adapted to generate more than two output channels from the at least one TD 

19 input channel . 

1 22. (canceled) 

1 23. (previously presented) The invention of claim 21, wherein: 

2 the delay nodes are adapted to delay the FD input signals based on inter-channel time difference 

3 (ICTD) data; and 

4 the multipliers are adapted to scale the delayed FD signals based on inter-channel level 

5 difference (ICLD) and inter-channel correlation (ICC) data. 

1 24. (original) The invention of claim 23, wherein: 

2 the at least one input channel is at least one combined channel generated by performing binaural 

3 cue coding (BCC) on an original auditory scene; and 

4 the ICTD, ICLD, and ICC data are cue codes derived during the BCC coding of the original 

5 auditory scene. 
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1 25. (original) The invention of claim 23, wherein the configuration is adapted to apply 

2 different ICTD, ICLD, and ICC data to different frequency sub-bands of the corresponding FD signals. 

1 26. (canceled) 

1 27. (previously presented) The invention of claim 21, wherein 

2 each filter is a TD late reverberation filter adapted to generate a different TD diffuse channel 

3 from the at least one TD input channel; 

4 the configuration comprises, for each output channel in the auditory scene: 

5 another TD-FD converter adapted to convert a corresponding TD diffuse channel into an 

6 FD diffuse signal; and 

7 an other multiplier adapted to scale the FD diffuse signal to generate a scaled FD diffuse 

8 signal, wherein a corresponding combiner is adapted to combine the scaled FD diffuse signal with a 

9 corresponding one of the scaled, delayed FD signals to generate one of the synthesized FD signals. 

1 28. (original) The invention of claim 27, wherein: 

2 each other multiplier is adapted to scale the FD diffuse signal based on ICLD and ICC data; 

3 the at least one input channel is at least one combined channel generated by performing BCC 

4 coding on an original auditory scene; and 

5 the ICLD and ICC data are cue codes derived during the BCC coding of the original auditory 

6 scene. 

1 29. (original) The invention of claim 28, wherein the configuration applies different ICLD 

2 and ICC data to different frequency sub-bands of the corresponding FD signals. 

1 30. (previously presented) The invention of claim 21, wherein: 

2 each filter is an FD late reverberation filter adapted to generate a different FD diffuse signal from 

3 one of the FD input signals; and 

4 the configuration further comprises a further plurality of multipliers adapted to scale the FD 

5 diffuse signals to generate a plurality of scaled FD diffuse signals, wherein the combiners are adapted to 

6 combine the scaled FD diffuse signals with the scaled, delayed FD signals to generate the synthesized FD 

7 signals. 
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1 31. (original) The invention of claim 30, wherein at least two FD late reverberation filters 

2 have different filter lengths. 

1 32. (original) The invention of claim 30, wherein: 

2 the FD diffuse signals are scaled based on ICLD and ICC data; 

3 the at least one input channel is at least one combined channel generated by performing BCC 

4 coding on an original auditory scene; and 

5 the ICLD and ICC data are cue codes derived during the BCC coding of the original auditory 

6 scene. 

1 33. (original) The invention of claim 32, wherein the configuration applies different ICLD 

2 and ICC data to different frequency sub-bands of the corresponding FD signals. 

1 34. (canceled) 

1 35. (currently amended) The invention of claim [[34]] 21, wherein the apparatus is adapted 

2 to synthesize a surround sound auditory scene. 

1 36. (currently amended) The invention of claim [[34]] 2J_, wherein the apparatus is adapted 

2 to use a single input channel to synthesize the auditory scene. 

1 37. (original) The invention of claim 21, wherein the apparatus comprises one filter for 

2 every output channel in the auditory scene. 

1 38. (currently amended) The invention of claim 21, wherein each filter has a substantially 

2 random frequency response with a substantially flat spectral envelope. 

1 39. (original) The invention of claim 21, wherein: 

2 the apparatus is adapted to generate, combine, and convert for TD input channel frequencies less 

3 than a specified threshold frequency; and 

4 the apparatus is further adapted to apply alternative auditory scene synthesis processing for TD 

5 input channel frequencies greater than the specified threshold frequency. 
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1 40. (original) The invention of claim 39, wherein the alternative auditory scene synthesis 

2 processing involves coherence-based BCC coding without the filters that are applied to the TD input 

3 channel frequencies less than the specified threshold frequency. 

1 41-42. (canceled) 

1 42. (previously presented) The invention of claim 41, wherein a single input channel is used 

2 to synthesize the auditory scene. 

1 43. (previously presented) A method for synthesizing an auditory scene, comprising: 

2 processing at least one input channel to generate two or more processed input signals; 

3 filtering the at least one input channel to generate two or more diffuse signals; and 

4 combining the two or more diffuse signals with the two or more processed input signals to 

5 generate a plurality of output channels for the auditory scene, wherein: 

6 the method applies the processing, filtering, and combining for input channel frequencies 

7 less than a specified threshold frequency; and 

8 the method further applies alternative auditory scene synthesis processing for input 

9 channel frequencies greater than the specified threshold frequency. 

1 44. (previously presented) The invention of claim 43, wherein the alternative auditory scene 

2 synthesis processing involves coherence-based BCC coding without the filtering that is applied to the 

3 input channel frequencies less than the specified threshold frequency. 

1 45. (previously presented) Apparatus for synthesizing an auditory scene, comprising: 

2 a configuration of at least one time domain to frequency domain (TD-FD) converter and a 

3 plurality of filters, the configuration adapted to generate two or more processed FD input signals and two 

4 or more diffuse FD signals from at least one TD input channel; 

5 two or more combiners adapted to combine the two or more diffuse FD signals with the two or 

6 more processed FD input signals to generate a plurality of synthesized FD signals; and 

7 two or more frequency domain to time domain (FD-TD) converters adapted to convert the 

8 synthesized FD signals into a plurality of TD output channels for the auditory scene, wherein: 

9 the configuration comprises: 

10 a first TD-FD converter adapted to convert the at least one TD input channel into 

11 a plurality of FD input signals; 



Serial No. 10/815,591 



Baumgarte 7-12 (992.1130) 



12 a plurality of delay nodes adapted to delay the FD input signals to generate a 

13 plurality of delayed FD signals; and 

14 a plurality of multipliers adapted to scale the delayed FD signals to generate a 

15 plurality of scaled, delayed FD signals; 

16 the delay nodes are adapted to delay the FD input signals based on inter-channel time 

17 difference (ICTD) data; and 

1 8 the multipliers are adapted to scale the delayed FD signals based on inter-channel level 

19 difference (ICLD) and inter-channel correlation (ICC) data. 

1 46. (previously presented) The invention of claim 45, wherein: 

2 the at least one input channel is at least one combined channel generated by performing binaural 

3 cue coding (BCC) on an original auditory scene; and 

4 the ICTD, ICLD, and ICC data are cue codes derived during the BCC coding of the original 

5 auditory scene. 

1 47. (previously presented) The invention of claim 45, wherein the configuration is adapted 

2 to apply different ICTD, ICLD, and ICC data to different frequency sub-bands of the corresponding FD 

3 signals. 

1 48-50. (canceled) 

1 51. (previously presented) Apparatus for synthesizing an auditory scene, comprising: 

2 a configuration of at least one time domain to frequency domain (TD-FD) converter and a 

3 plurality of filters, the configuration adapted to generate two or more processed FD input signals and two 

4 or more diffuse FD signals from at least one TD input channel; 

5 two or more combiners adapted to combine the two or more diffuse FD signals with the two or 

6 more processed FD input signals to generate a plurality of synthesized FD signals; and 

7 two or more frequency domain to time domain (FD-TD) converters adapted to convert the 

8 synthesized FD signals into a plurality of TD output channels for the auditory scene, wherein: 

9 the apparatus is adapted to generate, combine, and convert for TD input channel 

10 frequencies less than a specified threshold frequency; and 

1 1 the apparatus is further adapted to apply alternative auditory scene synthesis processing 

12 for TD input channel frequencies greater than the specified threshold frequency. 
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52. (previously presented) The invention of claim 51, wherein the alternative auditory scene 
synthesis processing involves coherence-based BCC coding without the filters that are applied to the TD 
input channel frequencies less than the specified threshold frequency. 



1 53. (new) Apparatus for synthesizing an auditory scene, comprising: 

2 a configuration of at least one time domain to frequency domain (TD-FD) converter and a 

3 plurality of filters, the configuration adapted to generate two or more processed FD input signals and two 

4 or more diffuse FD signals from at least one TD input channel; 

5 two or more combiners adapted to combine the two or more diffuse FD signals with the two or 

6 more processed FD input signals to generate a plurality of synthesized FD signals; and 

7 two or more frequency domain to time domain (FD-TD) converters adapted to convert the 

8 synthesized FD signals into a plurality of TD output channels for the auditory scene, wherein: 

9 the configuration comprises: 

10 a first TD-FD converter adapted to convert the at least one TD input channel into 

11 a plurality of FD input signals; 

12 a plurality of delay nodes adapted to delay the FD input signals to generate a 

13 plurality of delayed FD signals; and 

14 a plurality of multipliers adapted to scale the delayed FD signals to generate a 

1 5 plurality of scaled, delayed FD signals; 

16 the combiners are adapted to sum, for each output channel, one of the scaled, delayed FD 

17 signals and a corresponding one of the diffuse FD signals to generate one of the synthesized FD signals; 

18 each filter is a TD late reverberation filter adapted to generate a different TD diffuse 

19 channel from the at least one TD input channel; and 

20 the configuration comprises, for each output channel in the auditory scene: 

21 another TD-FD converter adapted to convert a corresponding TD diffuse channel 

22 into an FD diffuse signal; and 

23 an other multiplier adapted to scale the FD diffuse signal to generate a scaled FD 

24 diffuse signal, wherein a corresponding combiner is adapted to combine the scaled FD diffuse signal with 

25 a corresponding one of the scaled, delayed FD signals to generate one of the synthesized FD signals. 

1 54. (new) The invention of claim 53, wherein: 

2 each other multiplier is adapted to scale the FD diffuse signal based on ICLD and ICC data; 

3 the at least one input channel is at least one combined channel generated by performing BCC 

4 coding on an original auditory scene; and 
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5 the ICLD and ICC data are cue codes derived during the BCC coding of the original auditory 

6 scene. 

1 55. (new) The invention of claim 54, wherein the configuration applies different ICLD and 

2 ICC data to different frequency sub-bands of the corresponding FD signals. 

1 56. (new) Apparatus for synthesizing an auditory scene, comprising: 

2 a configuration of at least one time domain to frequency domain (TD-FD) converter and a 

3 plurality of filters, the configuration adapted to generate two or more processed FD input signals and two 

4 or more diffuse FD signals from at least one TD input channel; 

5 two or more combiners adapted to combine the two or more diffuse FD signals with the two or 

6 more processed FD input signals to generate a plurality of synthesized FD signals; and 

7 two or more frequency domain to time domain (FD-TD) converters adapted to convert the 

8 synthesized FD signals into a plurality of TD output channels for the auditory scene, wherein: 

9 the configuration comprises: 

10 a first TD-FD converter adapted to convert the at least one TD input channel into 

11 a plurality of FD input signals; 

12 a plurality of delay nodes adapted to delay the FD input signals to generate a 

1 3 plurality of delayed FD signals; and 

14 a plurality of multipliers adapted to scale the delayed FD signals to generate a 

15 plurality of scaled, delayed FD signals; 

16 the combiners are adapted to sum, for each output channel, one of the scaled, delayed FD 

17 signals and a corresponding one of the diffuse FD signals to generate one of the synthesized FD signals; 

18 each filter is an FD late reverberation filter adapted to generate a different FD diffuse 

19 signal from one of the FD input signals; and 

20 the configuration further comprises a further plurality of multipliers adapted to scale the 

21 FD diffuse signals to generate a plurality of scaled FD diffuse signals, wherein the combiners are adapted 

22 to combine the scaled FD diffuse signals with the scaled, delayed FD signals to generate the synthesized 

23 FD signals. 

1 57. (new) The invention of claim 56, wherein at least two FD late reverberation filters have 

2 different filter lengths. 
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1 58. (new) The invention of claim 56, wherein: 

2 the FD diffuse signals are scaled based on ICLD and ICC data; 

3 the at least one input channel is at least one combined channel generated by performing BCC 

4 coding on an original auditory scene; and 

5 the ICLD and ICC data are cue codes derived during the BCC coding of the original auditory 

6 scene. 

1 59. (new) The invention of claim 58, wherein the configuration applies different ICLD and 

2 ICC data to different frequency sub-bands of the corresponding FD signals. 

1 60. (new) Apparatus for synthesizing an auditory scene, comprising: 

2 a configuration of at least one time domain to frequency domain (TD-FD) converter and a 

3 plurality of filters, the configuration adapted to generate two or more processed FD input signals and two 

4 or more diffuse FD signals from at least one TD input channel; 

5 two or more combiners adapted to combine the two or more diffuse FD signals with the two or 

6 more processed FD input signals to generate a plurality of synthesized FD signals; and 

7 two or more frequency domain to time domain (FD-TD) converters adapted to convert the 

8 synthesized FD signals into a plurality of TD output channels for the auditory scene, wherein: 

9 the configuration comprises: 

10 a first TD-FD converter adapted to convert the at least one TD input channel into 

11 a plurality of FD input signals; 

12 a plurality of delay nodes adapted to delay the FD input signals to generate a 

13 plurality of delayed FD signals; and 

14 a plurality of multipliers adapted to scale the delayed FD signals to generate a 

15 plurality of scaled, delayed FD signals; 

16 the combiners are adapted to sum, for each output channel, one of the scaled, delayed FD 

17 signals and a corresponding one of the diffuse FD signals to generate one of the synthesized FD signals; 

18 and 

19 the apparatus comprises one filter for every output channel in the auditory scene. 

1 61. (new) Apparatus for synthesizing an auditory scene, comprising: 

2 a configuration of at least one time domain to frequency domain (TD-FD) converter and a 

3 plurality of filters, the configuration adapted to generate two or more processed FD input signals and two 

4 or more diffuse FD signals from at least one TD input channel; 
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5 two or more combiners adapted to combine the two or more diffuse FD signals with the two or 

6 more processed FD input signals to generate a plurality of synthesized FD signals; and 

7 two or more frequency domain to time domain (FD-TD) converters adapted to convert the 

8 synthesized FD signals into a plurality of TD output channels for the auditory scene, wherein: 

9 the configuration comprises: 

10 a first TD-FD converter adapted to convert the at least one TD input channel into 

11 a plurality of FD input signals; 

12 a plurality of delay nodes adapted to delay the FD input signals to generate a 

13 plurality of delayed FD signals; and 

14 a plurality of multipliers adapted to scale the delayed FD signals to generate a 

15 plurality of scaled, delayed FD signals; 

16 the combiners are adapted to sum, for each output channel, one of the scaled, delayed FD 

17 signals and a corresponding one of the diffuse FD signals to generate one of the synthesized FD signals; 

18 each filter has a random frequency response with a flat spectral envelope. 
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